Trends That Affect Temporal Analysis Using SourceForge Data

نویسندگان

  • Alexander C. MacLean
  • Landon J. Pratt
  • Jonathan L. Krein
  • Charles D. Knutson
چکیده

SourceForge is a valuable source of software artifact data for researchers who study project evolution and developer behavior. However, the data exhibit patterns that may bias temporal analyses. Most notable are cliff walls in project source code repository timelines, which indicate large commits that are out of character for the given project. These cliff walls often hide significant periods of development and developer collaboration—a threat to studies that rely on SourceForge repository data. We demonstrate how to identify these cliff walls, discuss reasons for their appearance, and propose preliminary measures for mitigating their effects in evolution-oriented studies.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Quantitative Analysis of Open Source Projects on SourceForge

Relatively easy accessibility of high volumes of information about open source software makes it an interesting target for quantitative analysis meant to discover some hidden properties and trends of this software development model. In this work we demonstrate how such information can be acquired from the largest open source hosting facility — SourceForge — with nearly minimal effort. We compar...

متن کامل

Threats to Validity in Analysis of Language Fragmentation on SourceForge Data

Reaching general conclusions through analysis of SourceForge data is difficult and error prone. Several factors conspire to produce data that is sparse, biased, masked, and ambiguous. We explore these factors and the negative effect that they had on the results of “Impact of Programming Language Fragmentation on Developer Productivity: a SourceForge Empirical Study.” In addition, we question th...

متن کامل

Precipitation Trends Analysis in Southwest Asia during the Last Half Century

Precipitation is a climatic elements that have temporal - spatial distribution. In this research database of Global Precipitation Climatology Centre (GPCC) with a resolution 0.5×0.5 degree for 50 year is used, that was constituted with dimensions of 12800*600. Temporal data are on the columns and pixels (spatial data) located on the rows. The results show an increasing trend in spring and fall ...

متن کامل

Revealing the impact of changing land use of the annual spatiotemporal boundary layer height (Kermanshah Case Study)

Introduction Atmospheric boundary layer (ABL), is the lowest part of the atmosphere. Its behavior is directly influenced by its contact with earth surface. On earth it usually responds to changes in surface radiative forcing in an hour or less. In this layer physical quantities such as flow velocity, temperature, moisture, etc., display rapid fluctuations (turbulence) and vertical mixing is st...

متن کامل

Programming Language Trends in Open Source Development: An Evaluation Using Data from All Production Phase SourceForge Projects

In this work, we analyze data collected from the CVS repositories of 9,997 Open Source projects hosted on SourceForge in an effort to understand trends in programming language usage in the Open Source community between 2000 and 2005. The trends we consider include: 1) the relative popularity of the ten most popular programming languages over time, 2) the use of multiple programming languages by...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010